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FIELD OF THE INVENTION 

This invention is related to novel compounds derived from peptide nucleic acids that 
bind to complementary DNA and RNA strands. In particular, the invention is concerned 
with peptide nucleic acid analogs that are chiral, positively charged, more soluble in 
aqueous systems and bind to complementary nucleic acid sequences with high avidity 
and sequence discrimination ability. 



BACKGROUND OF THE INVENTION AND PRIOR ART 



Gene targeted drugs are designed with oligonucleotides of nucleobase sequence of 
10-20 units, complementary t to the regulatory region of the disease associated target gene. 
Upon administration, they bind to specific promoter targets (DNA) and block access to 
RNA polymerase and as a consequence, no mRNA or the corresponding gene products 
are produced (Agarwal, S. in Applied antisense oligonucleotide technology, Stein, C.A. 
& Kreig,A.M. Ed., 1978, Wiley-Liss Inc). Depending on the target chosen, the drugs can 
be developed for application as antiviral, anticancer agents or other instances where 
conventional drugs have serious limitations. They are also useful for research and in 
diagnostics for detection and isolation of specific nucleic acids. Since natural 
oligonucleotides are degraded by nucleases, there is considerable interest in synthetic 
analogues of oligonucleotides that are stable under physiological conditions (Uhlmann, E. 
and Peyman, A. Chem. Rev. 1990, 90, 543; Leyten, I., Herdewijn,P., Eur. J. Chem., 1998, 
33, 515). In order to achieve this, a large number of variations in polynucleotide 
backbones have been undertaken, although so far not with the desired results. The great 
majority of these backbone modifications led to decreased stability for hybrids formed 
between the modified oligonucleotide and its complementary natural oligonucleotide, 
except in the case of aminoethylglycyl peptide nucleic acids (aegPNA I) (Nielsen, P. E. 
etal Science 1991, 254, 1497). 
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In these compounds, a linear amide chain composed of repeating aminoethylglycine 
units to which nucleobase ligands are attached replaces the cyclic sugar-phosphate 
backbone of DNA and are amenable for easy preparation by solid phase peptide 
synthesis. PNAs can be used to target ss or ds DNA to produce gene regulatory molecules 
and as hybridization probes. Polypyrimidine PNAs form triple helices with ds DNA and 
are useful as antigene agents. Triplex formation and strand displacement are the only 
known principles in the art for sequence-specific recognition of ds DNA, with strand 
displacement being superior to the former. This is because unlike triplex formation, it is 
not restricted to homopurine-homopyrimidine sequences and allows the recognition of 
any sequence by Watson-Crick base pairing of the four natural nucleobases. The strand 
invasion of double strand DNA by unmodified PNAs is only possible in the absence of 
salts or at ionic strengths much lower than the physiological conditions. (Hyrup, B. and 
Nielsen,P.R, BioMed Chem. Lett. 1996, 4, 5). 

The PNA molecules although make stable hybrids with complementary DNA, are 
poorly soluble in aqueous media and being non-chiral bind to complementary DNA in 
both parallel and antiparallel directions whenever the sequence permits (Hanvey, J. C. et 
al Science 1992, 258, 1481; Uhlmann, E. et al Angew. Chem. Int. Ed. Engl. 1998, 37, 
2796). Several structural variations within such a PNA backbone have been sought, in 
order to improve the water solubility, orientation specificity and DNA double strand 
invasion at physiological conditions, without compromising on the DNA affinity and 
sequence specificity. Typically, these involve conjugation with positively charged ligands 
such as lysine (Kim, S. H. et al J. Am. Chem. Soc. 1993, 115, 6477; Koch, T. et al 
Tetrahedron Lett. 1995, 36, 6933; Ishihara, T. and Corey, D. R. J. Am. Chem. Soc. 1999, 
121, 2012) and introduction of substituents in the ethylenediamine or glycine sectors of 
the backbone. (Haaima, G. et al Angew. Chem. Int. Ed. Engl. 1996, 35, 1939; 
Gangamani, B. P. et al Tetrahedron 1999, 55, 177; D'Costa, M. et al Organic Lett. 1999, 
I, 1513.). However, it is well understood in the art of the PNA backbone modifications 
that no PNA analogs have yet emerged where these problems have been simultaneously 
and satisfactorily addressed (Ganesh and Nielsen, Curr. Organic Chemistry, 2000, 9, 
916). The following relevant patents on PNA so far are cited here. 
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Patent 
No. 


Authors 


Title 


1 


WO 

9964449 


Harriot, P.; Nelson, 
J.; Wallace, A. 


New cell permeable signal peptides useful for 
intracellular delivery of molecules 


2 


WO 

9816550 


Lowe, G. 


New proline based chiral peptide nucleic acid 
compounds- having strong hybridisation activity, 
useful as antisense, antigene agents or in 
molecular biology 


3 


WO 

9714793 


Demidov, V. V.; 
Frank-Kamenetskii, 
M. D.; Veselkov, 
A. G. 


Nucleic acid clamp comprising two sequences of 
peptide nucleic acids- connected by flexible linker 
with stabilizer at ends, used e.g., to cleave target 
nucleic acid 


4 


WO 

9620212 


Buchardt, 0.; 
Lagriffoul, P.; 
Nielsen, P. E. 


New peptide nucleic acid monomers with chiral 
backbone- may be used in preparation of 
oligomers useful e.g., as research tools 



OBJECTS OF THE INVENTION 

It is the primary object of the present invention to provide novel cyclic, chiral 
PNAs - homo or hetero oligomers derived from the cyclic prolyl monomers that bind 
ssDNA and RNA strands to form stable hybrids with much higher affinity than the parent 
peptide nucleic acids. 



It is one object of the present invention to provide cyclic monomers for peptide 
nucleic acid oligomers that bind single and double strand DNAs to form stable hybrids 
with higher affinity, than the corresponding complexes of the standard peptide nucleic 
acids I composed from linear monomer units with DNA. 

It is another object of invention to provide monomers that are both cyclic and 
chiral for making oligomers that bind one strand of a double stranded polynucleotide, in a 
sequence and orientation specific manner. 

It is yet another object of the invention to provide cyclic, chiral monomers that 
confer positive charge/s on the derived oligomers and make them more soluble in water 
than the standard peptide nucleic acids. 
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It is a further object to provide synthetic procedures for cyclic, chiral peptide 
nucleic acid monomers needed for the oligomers of the invention. 

SUMMARY OF THE INVENTION 

In summary, the invention provides novel entities viz., cyclic, chiral, positively 
charged oligomers that have specific advantages of higher solubility in water than 
standard PNA and bind to complementary DNA sequences in a sequence and orientation 
specific manner. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a novel class of compounds (arepPNA H) 
incorporating cyclic monomers based on aminoethylprolyl (aepPNA) units that bind 
specifically to complementary ss and ds DNA and RNA strands. The compounds of the 
invention generally comprise ligands B (natural nucleobases thymine, cytosine, adenine 
or guanine, inosine, uracil and their synthetic analogs) linked to cyclic monomeric units 
that are an integral part of the peptide backbone. Further, the presence of two chiral 
carbon atoms (denoted by *) leads to four possible diastereomers and the compounds of 
the invention comprise all these stereomers. 

In general embodiment, the oligomers of the invention are either homopolymers 
aepPNA II of the type 




aep PNA H 
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- Wherein m and n are 1 to 2 and x = 1-20, 

- each of B'-B z is independently selected from the group consisting of H, HO, NH2, 
naturally occurring nucleobases adenine (A), thymine (T), cytosine (C) and guanine 
(G), non-naturally occurring nucleobases, DNA intercalators, heterocyclic moieties 
and reporter ligands, 

- each chiral monomeric unit independently selected from the four possible 
diastereomers, 

- R^H/Flurophore/Biotin, R 2 =OH/NH(CH 2 ) 2 COOH/ 
NH(CH 2 ) 3 NH(CH 2 )4NH(CH 2 ) 3 NH 2 . 

Or heteropolymers aepPNA III involving one or more substitution of the aeg 
monomeric unit in the aepPNA II as below : 




aep PNA m 

Wherein 

- a, b,c,d,m,n are integers with values in the range ItolO and various combinations 
thereof, 

-Ri is H/COCH3 or L (corresponding to a fluorophore e.g. dansyl, carboxyfluorescein), 
-R 2 is OH, NH 2 , NH CH 2 CH 2 COOH, sperminyl. i.e., NH(CH 2 ) 3 NH(CH 2 ) 4 
NH(CH 2 ) 3 NH 2 . 

In the following preferred embodiments, the compounds of the invention have m=n=l ; B z 
= T; R!=H; R 2 = NH (CH^COOH, with 

(i) a=7,b=l,c=d=0, 

(ii) a=c=3, b=d=l, 

(iii) a=b=c=d=l, repeating twice in that order, 

(iv) a=b=c=0, d=8, and 

(v) a=d=0, b=l, c=7-l 1 and with various combinations of B z . 
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The oligomers of the invention are synthesized by adaptation of standard peptide 
synthesis procedures, either in solution or in solid phase. The preferred monomer 
precursor-synthon, according to the invention, has the structure IV 

T 




Where Ri = H/tBoc, R 2 = Ome/OH at positions at 2 and 4 lead to four 
diastereomers, each synthesized by the process described in the invention (scheme 1) 
depending upon the stereochemistry of the starting material. 



Scheme 1 NHBoc 




R = H, Me.Benzyl 
R' = H. Benzoyl 

In all the monomer synthons and peptide nucleic acid analogs, according to the 
invention, the nucleobase is T. In alternative structures, B may be a non-naturally 
occurring nucleobase (nucleobase analog), another base-binding moiety, an aromatic 
moiety, hydroxyl or even hydrogen. The homooligomers ae/?PNA II, each incorporating 
the above monomers at all positions or heteropolymers aepPNA III, with inclusion of 
standard peptide nucleic acid monomers in the aepYNA U at one or more sites, are able 
to recognize the complementary polypurine, single stranded DNA, generating 
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aepPNA:DNA complexes. Such a recognition of DNA taking place in the range of 8-15 
bases is of biological interest. 

While the oligomeric compounds of the invention show remarkably enhanced 
binding with complementary DNA as indicated by the increased Tm of hybrids, they also 
have significant discriminating ability as the presence of even a single mismatch in the 
complementary DNA strand would lead to significant destabilization of the complex. The 
present oligomeric compounds of the invention ae/?PNA II and aepPNA III are also 
more soluble in water than standard PNA I, making them more suitable for biological 
applications. 

Whereas the improved binding of the compounds of the invention should render 
them efficient as antisense agents, it is expected that an extended range of related analogs 
or derivatives, whose structures are mentioned in the invention, may cause strand 
displacement. This should render them efficient antigene agents in inhibition of gene 
expression in cells. 

BRIEF DESCRIPTION OF THE ACCOMPANYING DRAWINGS 

Figure 1: Scheme for synthesis of (N-Boc)-2-aminoethylbromide 

Figure 2: Scheme for N-alkylation of 4/?-hydroxy-2 1 S'-prolinemethylester 

Figure 3: Scheme for the synthesis of [l-(N-Boc)-2-aminoethyl]-4 1 S'-(N3-Benzoylthymin- 

1 -yl)-2S'-prolinemethylester 

Figure 4: Scheme for N-alkylation of 4i?-hydroxy-2i?-prolinemethylester 

Figure 5: Scheme for the synthesis of [l-(N-Boc)-2-aminoethyl]-4 J S'-(N3-Benzoylthymin- 

1 -yl)-2i?-prolinemethylester 

Figure 6: General scheme for ester hydrolysis to obtain protected monomer units 
Figure 7: General scheme for solid phase synthesis of oligomers showing preparation of 
linear unprotected polyamides carrying novel monomers of the invention 
Figure 8: Oligomer sequences comprising the novel monomer of the invention 
Figure 9: Reverse phase HPLC of some new oligomers of the invention 
Figure 10: MALDI-TOF spectra of some new oligomers of the invention 
Figure 11. UV melting profiles 
Figure 12: Gel electrophoresis. 
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The peptide nucleic acids have been found to be superior to oligonucleotides since 
they have significantly higher affinity for complementary RNA or DNA. Since they are 
amides of non-biological amino acids, they are biostable and resistant to enzymatic 
degradation by proteases and nucleases. However, their properties in terms of solubility, 
cellular uptake and specific orientation in hybridization are not satisfactory and need to 
be improved by novel approaches. 

The main descript of the invention is the structural design which simultaneously 
installs positive charges and chirality on the main backbone accompanied by controlled 
conformational restriction as a strategy to overcome the many limitations of peptide 
nucleic acids. This is achieved by linking of the a-carbon of glycyl component with the 
p-carbon of the side arm carrying the ligand B in peptide nucleic acids I to obtain the 
monomer synthons IV. The compounds of the invention, the size of the cyclic system are 
determined by independent values for m and n, which vary between 1 and 2. In the 
preferred synthons of the invention, m=n=l, correspond to aminoethylprolyl PNA 
aepPNA(II). An important consequence of such a structural design is the presence of a 
tertiary amino group in the oligomer of the invention, which has a potential for 
protonation at physiological pH, leading to positively charged oligomers that have better 
hybridization properties with the negatively charged complementary DNA and improved 
solubility in aqueous systems. 

In another interesting aspect of the invention, chirality is introduced at 2 centers 
in the cyclic pyrrolidine ring system, resulting in 4 diastereomeric monomers. The 
polymeric oligomers of the invention are derived from these chiral monomers, leading to 
stereoregular (homochiral) or stereo-alternating polymers (heterochiral). Such 
stereocontrolled and stereodefined oligomers derived from the chiral monomers of the 
invention can be used to influence the strand orientations of PNA in PNA: DNA 
complexes and have benefits in terms of unambiguous, orientation dependent 
complementary recognition of DNA by the oligomers of the invention. 

In a further aspect of the invention, a novel process for the synthesis of the 
monomers is described (Scheme 1). The route consists of 2 steps starting from 4- 
hydroxyprolinemethylester hydrochloride and involves N-alkylation, followed by a 
Mitsunobu reaction. The invention also comprises a process for making the (N-Boc)-2- 
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aminoethyl bromide, which is the alkylating agent. The process described in the 
invention is also applicable for the synthesis of the other diastereomers, depending on the 
stereochemistry of the starting material. 

It is observed that oligomers of the present invention overcome some of the 
biochemical / biophysical limitations of peptide nucleic acids I such as low solubility in 
water and relative orientations of strands in duplex and triplex structures. The oligomers 
of the invention are synthesized by a methodology similar to that of PNA, in which the 
monomer synthons are coupled either to the standard PNA monomer or another modified 
monomer of the invention. This is useful in adaptation of the present synthetic set-ups for 
the synthesis of oligomers of the invention. 

General method for the synthesis of monomers 

One monomer synthon, according to the invention, is prepared by N-alkylation of 
4/^hydroxy-2 1 S'-prolinemethyl ester 1 with (N-Boc)-2-aminoethylbromide 2 to afford [1- 
(N-Boc)-2-aminoethyl]-4i?-hydroxy-2S-proline derivative 3 (Figure 2). The alkylating 
agent 2 was prepared in two steps from 2-aminoethanol by first making its NH-Boc 
derivative followed by bromination (Figure 1). The C4 hydroxyl function in 3 was 
replaced with N3-benzoylthymine using Mitsunobu reaction to yield the (2£4S) ester 4a 
(Figure 3). Similarly, N-alkylation of 4i?-hydroxy-2R-prolinemethylester 5 (Figure 4), 
followed by Mitsunobu reaction with N3-benzoylthymine (Figure 5) gave the (2R,4S> 
ester 6a. The (2R,4S) 6a and (2S,4S) 4a esters were hydrolyzed (Figure 6) to provide the 
cyclic monomers 4b and 6b for the oligomer assembly. 

Additional objects, advantages and novel features of the cyclic monomers, and the 
process invention will become apparent to those skilled in the art of synthesis for e.g., the 
4-hydroxy groups in the starting prolinemethylesters can be inverted by Mitsunobu 
reaction to obtain the other two diastereomers. Other features of the invention are 
illustrated by the following examples thereof, which are not intended to be limiting. 

General method for synthesis of oligomers of the invention 

Oligomers of the invention were prepared generally in accordance with the 
currently practiced art using solid phase synthesis (Figure 7). Merrifield resin derivatized 
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with N-Boc-p-alanine (0.13 meq/g of resin) was treated with a three-fold excess of a 
monomer to be coupled in DMF, along with the coupling agents, 
diisopropylcarbodiimide and HOBt, (0.4 meq each/g resin) in DMF (2ml/g resin). The 
removal of the Boc protecting group in the product is accomplished by trifluoroacetic 
acid treatment and the efficiency of the deprotection and the coupling reactions were 
determined by the Kaiser test. After sequential addition of monomers, the product 
oligomer comprising the monomers of the invention was released from the resin by 
treatment with trifluoroacetic acid-trifluoromethane sulphonic acid. The products were 
purified using FPLC, their purity checked by HPLC (Figure 9) with acetonitrile-water 
(0.1% TFA) gradient and their molecular composition confirmed by MALDI-TOF mass 
spectrometry (Figure 10). The oligomer sequences synthesized by this method are listed 
in Figure 8. 

General method for evaluating the stability of hybrids of compounds under 
invention with complementary polynucleotides 

The oligomers of the invention prepared as above were evaluated for their ability 
to hybridize with complementary oligonucleotides. This was done by monitoring the 
temperature dependent UV absorbance (260 nm) of stoichiometrically mixed solutions of 
PNA oligomers with that of specific sequences of oligodeoxyribonucleotides. The 
temperatures at the midpoint of the observed sigmoidal transitions (Tm) were taken as a 
measure of the stability of, the PNA: DNA hybrids (Figure 11, Table 1). When the 
stoichiometry of PNA: DNA is 2:1, the complexes are present: as triplexes. The absence 
of a well-defined transition was considered as evidence for unstable PNA: DNA 
complexes. The results indicated that the presence of even a single subunit of compounds 
of the invention II increased the stability of the derived hybrids as compared to the 
standard PNA of identical sequence (aregPNA I). An increase in the number of 
monomelic units of the invention further enhanced the stability of complexes to the 
extent that the homooligomeric PNA: DNA complex did not disassociate even upto 80°C, 
indicating an extremely high stability. The presence of a single mismatch in the 
complementary oligonucleotide of PNA: DNA mixture led to very significant 
destabilization of the complex. The formation of complexes was also checked 
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independently by gel electrophoresis experiments under non-denaturing conditions 
(Figure 12).in which the successful complex formation is accompanied by a retardation in 
the electrophoretic mobility of the corresponding samples. Further, the oligomers or their 
complexes even with prolonged storage did not precipitate, unlike analogous standard 
PNA solutions, suggesting remarkably improved aqueous solubility of the oligomers of 
the invention. These results demonstrate the novel properties of the oligomers composed 
of the preferred monomers of the invention - namely enhancing the aqueous solubility 
and stability of the complexes with oligonucleotides, without compromising the sequence 
fidelity. 

Those skilled in the art will appreciate that numerous changes and modifications 
may be made to the preferred embodiments of the invention and that such changes and 
modifications may be made without departing from the spirit of the invention. It is 
therefore intended that the appended claims cover all such equivalent variations as fall 
within the true spirit and scope of the invention. 



Table 1: UV-T m (°C) of DNA:PNA complexes* 


DNA:PNA 2 


VS 


VR 


1 14:7 


44 


42 (38) 


2 14:8 


51 


54 


3 14:9 


>75 


>75 (67) 


4 14:10 


>80 


>80 (>74) 


5 15:9 


nd 


nd 


6 15:10 


nd 


nd 


7 14:11 




43 (38) 


DNA:PNA 






8 16:12 


41 




9 16:13 




39 



*nd: not detected. Buffer: lOmM sodium phosphate, pH 7.4. Values in brackets: T m (°C) with 
100 mM NaCl. T m values are accurate to (±) 1°C. Experiments were repeated at least thrice 
and the T m values were obtained from the peaks in the first derivative plots. 
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Another embodiment of this invention provides a method of using peptide 
nucleic acid oligomers of the present invention for diagnosing and/or modulating the 
expression of genes in organisms. The modulation includes inhibiting transcription and 
replication of the said gene. 

Yet another embodiment of the present invention provides a process for 
treating disease conditions associated with undesired protein production in an 
organism by using the novel compounds of the invention. 

Still another embodiment of the present invention provides a pharmaceutical 
composition comprising a compound of the present invention along with any other 
pharmaceutically effective agents. 

Additional objects, advantages and novel features of this invention will 
become apparent to those skilled in the art upon examination of the following 
examples thereof, which are not intended to be limiting. 

EXAMPLE 1 

Preparation of the precursor (N-Boc)-2-aminoethanol, needed to prepare the alkylating 
agent. 

To a cooled, stirred solution of 2-aminoethanol (5.9ml, 0.098mol) and sodium 
hydroxide (3.6g, 0.098mol) in water-dioxane (1:1), was added drop-wise Boc-azide 
(10ml, 0.08 lmol). The reaction was stirred at room temperature overnight. The dioxane 
was then removed under vacuum, and the water layer extracted several times with ethyl 
acetate. The organic layer was dried over sodium sulphate and then, evaporated to 
dryness under vacuum to get the product (N-Boc)-2-aminoethanol (5.37g, 41% yield), 
which was used in further steps without purification. 

EXAMPLE 2 

Preparation of the alkylating agent (N-Boc)-2-aminoethylbromide 2required for the 
synthesis of compound 3 
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To an ice-cooled solution of (N-Boc)-2-aminoethanol (2.5g, 15.5mmol) and 
carbon tetrabromide (7.5g, 22.7mmol) in dry benzene (25ml), was added triphenyl 
phosphine (4.7g, 17.9mmol) in small portions over 10-15min. After 30 min. solvent was 
evaporated and the product immediately purified by silica gel column chromatography to 
get (N-Boc)-2-aminoethylbromide (5.53g, 79%). 

l H NMR (CDC1 3 ) 5: 4.95(br s, 1H, Ni/), 3.50(m, 4H, (C# 2 ) 2 ), 1 .45(s, 9H, C(C// 3 ) 3 ) 
EXAMPLE 3 

Alkylation of 4(R)-hydroxy-2(S)-prolinemethylester la with 2 to obtain starting material 
[l-(N-Boc)-2-aminoethyl]-4(R)-hydroxy-2(S)-prolinemethylester 3 for the synthesis of 
monomers of the invention „ 

A mixture of 4(i?)-hydroxy-2(5)-prolinemethylester hydrochloride (3.24g, 
17.9mmol), (N-Boc)-2-aminoethylbromide (2.0g, 8.9mmoI) and anhydrous potassium 
carbonate (3.69g, 26.7mmol) was stirred together in DMF: acetonitrile (1:1) at room 
temperature for 72 h under argon atmosphere. After completion of the reaction as 
indicated by TLC, solvents were removed in vacuo. The residue was taken in water and 
extracted with ethyl acetate (4x 30ml). The organic layer was dried over sodium sulphate 
and concentrated to get the crude product, which was purified by silica gel column 
chromatography. The pure product was obtained in 56% yield. 

l B. NMR (CDCI3) 8: 5.30(br s, 1H, N/f), 4.40(m, 1H, HA), 3.70(s, 3H, OCtf 3 ), 3.50(t, 1H, 
H5), 3.38(dd, 1H, #5'), 3.10(dd, 2H, Boc-NH-C# 2 ), 2.70(br m, 4H, H2, Boc-NH-CH 2 - 
CH 2 , OH), 2.50(dd, 1H,#3), 2.10(m, m,HV), 1.40(s, 9H, C(Ci/ 3 ) 3 ) 

EXAMPLE 4 

Alkylation of 4(R)-hydroxy-2(R)-prolinemethylester lb with 2 to obtain starting material 
fl-(N~Boc)-2<tmmoethylJ-4(R)-hydroxy~2(R)-prolmemethylester 5 for the synthesis of 
monomers of the invention 

A mixture of 4(i?)-hydroxy-2(7?)-prolinemethylester hydrochloride (3.24g, 17.9mmol), 
(N-Boc)-2-aminoethylbromide (2.0g, 8.9mmol) and anhydrous potassium carbonate 
(3.69g, 26.7mmol) were stirred together in DMF: acetonitrile (1:1) at room temperature 
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for 72 h under argon atmosphere. After completion of reaction as indicated by TLC, 
solvents were removed in vacuo. The residue was taken in water and extracted with ethyl 
acetate (4x 30ml). The organic layer was dried over sodium sulphate and concentrated to 
get the crude product, which was purified by silica gel column chromatography. The pure 
product was obtained in 56% yield from (N-Boc)-2-aminoethyl bromide. 
J H NMR (CDC1 3 ) 5: 5.30(br s, 1H, NH), 4.25(br s, 1H, HA), 3.72(s, 3H, OC# 3 ), 3.37(m, 
1H, H5), 3.40-3.05(m, 4H, H5\ Boc-NH-C# 2 , OH), 2.65(m, 3H, H2, Boc-NH-CH 2 - 
CH 2 ), 2.40(m, 1H, H3), 1.92(dd, 1H,#3'), 1.40(s, 9H, C(C/f 3 ) 3 ). 

13 C NMR (CDCI3) 5: 174.9(COOCH 3 ), 156.1(COOC(CH 3 ) 3 ), 78.6(C(CH 3 ) 3 ), 
70.0(OCH 3 ), 64.6(C4), 61.6(C5), 53.9(C3), 51.9(C2), 39.0(Boc-NH-(CH 2 )2), 
28.3((CH 3 ) 3 ). 

EXAMPLE 5 

[l-(N-Boc)-2-aminoethyl]-4(S)-(N3-bemoylthymin-l-yl)-2(S)-pro 

To a stirred solution of [l-(N-Boc)-2-aminoethyl]-4(i?)-hydroxy-2(5)- 
prolinemethylester 3 (0.9g, 3.2mmol), N3-benzoylthymine (0.74g, 3.2mmol) and 
triphenyl phosphine (lg, 3.8 mmol) in dry THF (10ml) at room temperature, was added 
dropwise diisopropylazodicarboxylate (DIAD, 0.76ml, 3.8mmol). After completion of 
the reaction as indicated by TLC (24h), the solvent was removed in vacuo and residue 
purified by silica gel column chromatography to get the pure product (0.6g, 38%). 
! H NMR (CDCI3) 5: 8.10(s, 1H, T-H6), 7.90(d, 2H, Bz, o-CH), 7.65(t, 1H, Bz-p-CH), 
7.50(t, 2H, Bz, m-CH), 5.25(m, 2H, Boc-N^ 5 HA), 3.80(s, 3H, OCi/ 3 ), 3.25(m, 4H, Boc- 
NH-C^ 2 , H5, H5'), 2.80(m, 3H, Boc-NH-CH 2 -C# 2 , HI), 2.60{m, 1H, H3), 2.00(s, 3H, T- 
CHs), 1.95(m, 1H, ^3'), 1.45(s, 9H, (CH 3 ) 3 ) 

EXAMPLE 6 

[l-(N-Boc)-2<iminoethyl]-4(S)-(N3-benzoylthymin-l-yl)-2(R)-prolinem 

To a stirred solution of l-(N-Boc-aminoethyl)-4(7?)-hydroxy-2(i?)- 
prolinemethylester 5 (0.9g, 3.2mmol), N3-benzoylthymine (0.74g, 3.2mmol) and 
triphenyl phosphine (lg, 3.8 mmol) in dry THF (10ml) at room temperature, was added 
dropwise diisopropylazodicarboxylate (DIAD, 0.76ml, 3.8mmol). After completion of 
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the reaction as indicated by TLC (24h), the solvent was removed in vacuo and the residue 
purified by silica gel column chromatography to get the pure product (0.47g, 30%). 
l H NMR (CDCI3) 5: 7.92(d, 2H, Bz-o-CH), 7.65(t, 1H, Bz-p-CH), 7.50(t, 2H, Bz-m-CH), 
7.40(8, 1H, r-#6), 5.25(m. 1H, H4), 4.90(br t, 1H, Boc-N#), 3.90(dd, 1H, #5), 3.72(s, 
3H, OCH3), 3.30(m, 3H, H5\ Boc-NH-Ci7 2 ), 2.92(dd, 1H, H2), 2.78(t, 2H, Boc-NH- 
CH 2 -C# 2 ), 2.55(m, 1H, H3), 2.20(m, 1H,#3'), 2.00(s, 3H, T-CHy), 1.45(s, 9H, C(C# 3 ) 3 ) 
,3 C NMR (CDCI3) 8: 172.5(COOCH 3 ), 169.0(COOC(CH 3 ) 3 ), 162.5(7/-C2), 155.8(Ph-CO), 149.5(7"-C4), 
137.5(r-C6), 134.8, 131.3, 130.0 & 128.9(Ph), 111.1(7^5), 78.8(C(CH 3 ) 3 ), 63.1(C4), 55.5(Boc-NH-Q/ 2 ), 
53.7(C2), 51.4(OCH 3 ), 50.6(C5), 38.7(Boc-NH-CH 2 -CH 2 ), 35.1(C3), 28. 1(C(CH 3 ) 3 ), 12.3(r-CH 3 ) 
EXAMPLE 7 

Solid phase synthesis, deprotection and purification of the oligomers 7-13 (Figure 8) of 
the invention 

The oligomers of the invention were synthesized on Merrifeld resin (100 mg, pre 
swollen in dichloromethane) derivatized with p-alanine as a spacer chain (0.13 meq/g of 
resin). Solid phase synthesis was carried out employing the well known Boc protection 
strategy involving successive cycles of (i) Boc deprotection using TFA-DCM (1:1,2 mL, 
20 min), (ii) neutralization of the resulting TFA salt by washing with 5% DIPEA in 
DCM, (2mL, 5 min) (iii) coupling of the desired aep/aegYNK monomer in DMF using 
HOBT and diisopropylcarbodiimide as the coupling agents (2-5 h). At the end, the 
oligomers were cleaved from the solid support by TFMSA/TFA to obtain oligomers with 
P-alanine as a free carboxylic acid at C-terminus. The oligomers 7-13 were purified 
initially by gel filtration over Sephadex G25 and later by reverse phase FPLC on a C-8 
column, eluting with an ascending gradient of water-acetonitrile containing 0.1% TFA. 
The purity was checked by reverse phase HPLC on a C-18 column and molecular 
composition confirmed by MALDI-TOF mass spectrometric analysis. 

EXAMPLE 8 

DNA hybridization studies with the oligomers of the invention : 

The concentration of oligonucleotides (14 - 16, Figure 8) and oligomers of the 
invention were determined from the absorbance at 260 nm, using the molar extinction 
coefficients, of A, 15.4 cm 2 /umol; T 8.8 cm 2 /|imol, G, 11.7 cm 2 /umoI, C, 7.3 cm 2 /umol. 
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All Tm measurements were carried out on Perkin-Elmer Lambda UV spectrometer 
equipped with a multiple cell holder and variable temperature accessory. A 
programmable Julabo water circulator controlled the temperature. The samples were 
prepared by mixing calculated amounts of stock oligonucleotide and the oligomer of the 
invention together with the addition of calculated amounts of NaCl and sodium 
phosphate, pH 7.0. to a final volume of 2 mL in a sample cuvette. The samples were 
annealed by heating at 85°C for 5 min, followed by slow cooling to 5°C. The OD 2 60 was 
then recorded in steps from 10-85°C, with a temperature increment of 0.5°C / min. The 
results were plotted as percent hyperchromicity at 260 nm as a function of temperature. 
Analyses of the data were performed by using Microcal Origin software. The melting 
temperatures were determined from the midpoints of the percent hyperchromicity at 
260nm Vs temperature plots. 

EXAMPLE 9 

Gel electrophoretic experiments 

The samples were prepared by mixing stock solutions of the oligomers of the 
invention and oligonucleotides in tris-EDTA buffer, pH 7.0. These were annealed by 
heating at 85°C for 5 min., followed by slow cooling to 10°C. The samples were 
subjected to gel electrophoresis using 15% PAGE (acrylamide: bisacrylamide, 29:1) gels 
of 1 mm thick and 15 cm long Electrophoresis were carried out at a constant voltage of 
150 V till the marker bromophenol dye migrated to 12 cm length on the gel. The bands 
were visualized by illuminating the gel (254 nm) after placing it on a fluorescent thin- 
layer silica gel chromatographic plate. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: 

(ii) TITLE OF INVENTION 

(iii) NUMBER OF SEQUENCES 

(iv) CORRESPONDENCE ADDRESS 

(v) COMPUTER READBLE FORM 

(A) MEDIUM TYPE: Floppy Disk 

(B) COMPUTER: IBM PC Compatible 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: Microsoft Word 

(vi) CURRENT APPLICATION DATA 

(vii) ATTORNEY / AGENT INFORMATION 

(viii) TELECOMMUNICATION INFORMATION 

(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyf (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 
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(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position I of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa P-Ala 
1 5 8 

(2) INFORMATION FOR SEQ ID NO:2: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 
(iii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 
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/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION : /label - MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO :2: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa (3-Ala 
1 5 8 

(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 
(iv) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position I of the heterocycle." 
(ix) . FEATURE: 
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(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycie is attached at C4 of Nl-(2-aminoethyl) 
proline through position N 1 of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycie is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycie." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycie is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycie is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycie." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycie is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycie is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycie." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycie is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :3 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa P-Ala 
1 5 8 

(2) INFORMATION FOR SEQ ID NO:4: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 8 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of N 1 -(2-aminoethyl) 

proline through position Nl of the thymine." 
FEATURE: t 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 

proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 

proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 

proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 

proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of N 1 -(2-aminoethyl) 
proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 
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/note = " Thymine heterocycle is attached at C4 of Nl-(2-aminoethyl) 
proline through position Nl of the thymine." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :4: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa p-Ala 
1 5 8 

INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyi) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE i 
/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :2: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa (3-Ala 
1 5 8 

INFORMATION FOR SEQ ID NO:6: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 
(vi) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 
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(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-S ITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position I of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." * 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :6: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa- (3- Ala 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 
(vii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 
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(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) t 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :6: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa-p-Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 
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(D) OTHER INFORMATION: /label = MODIFIED-SITE 

• /note = "Adenine heterocycle is attached at C4 of Nl-(2-aminoethyl) 
proline through position N9 of the adenine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl-(2-aminoethyl) 
proline through position Nl of the thymine." 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Adenine heterocycle is attached at C4 of Nl-(2-aminoethyl) 

proline through position N9 of the adenine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl-(2-aminoethyl) 

proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl-(2-aminoethyl) 
proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 

FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Adenine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position N9 of the adenine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Adenine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position N9 of the adenine." 
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(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached at C4 of Nl-(2-aminoethyl) 
proline through position Nl of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to Nl-(2-aminoethyl) 
proline at C4 of proline through position 1 of the heterocycle." 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO :9: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa (3-ala 
1 5 10 

INFORMATION FOR SEQ ID NO: 1 0: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Cytosine heterocycle is attached at C4 of Nl-(2-aminoethyI) 
proline through position Nl of the cytosine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 
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note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Cytosine heterocycle is attached at C4 of N 1 -(2-aminoethyl) 
proline through position Nl of the cytosine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 

FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Adenine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position N9 of the adenine." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Cytosine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 12 

(D) OTHER INFORMATION : /label = MODIFIED-SITE 

note = "Cytosine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
Xaa B-ala 

1 5 10 



INFORMATION FOR SEQ ID NO: 1 1 : 

(i) ' SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Guanine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position N9 of the guanine." 
(ix) FEATURE: 



(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION : /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Guanine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the guanine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Adenine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position N9 of the adenine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Cytosine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = "Cytosine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 13 
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(D) OTHER INFORMATION : /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
Xaa 6-ala 

1 5 10 



INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Guanine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 
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(D) OTHER INFORMATION: /label = MODIFIED-SITE 

- /note = " Guanine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION : /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: * 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label - MODIFIED-SITE 

/note = " Cytosine heterocycle is attached to N-acetyi (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 7 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Cytosine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Thymine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the thymine." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 2: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa XaaXaaXaa P-Ala 
1 5 8 

INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unknown 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Guanine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site- 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Guanine heterocycle is attached at C4 of Nl -(2-aminoethyl) 
proline through position Nl of the guanine." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 9 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Thymine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Cytosine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = " Adenine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 7 of the heterocycle." 
(ix) FEATURE: 
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(A) NAME / KEY : Modified site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = MODIFIED-S ITE 

/note = " Cytosine heterocycle is attached to N-acetyl (2-aminoethyl) 
glycine through the N-acetyl group at position 1 of the heterocycle." 
(ix) FEATURE: 

(A) NAME / KEY : Modified site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label = MODIFIED-SITE 

/note = "Thymine heterocycle is attached to N-acetyl (2-aminoethyl) * 
glycine through the N-acetyl group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO :2: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa p-Ala 
1 5 8 



CLAIMS: 



1 . A novel chiral, peptide nucleic acid oligomers having the formula : 



B 1-Z 




aep PNA n 



Wherein, 

- m and n are 1 to 2 and x = 1-20; 

- each of B^B 2 is independently selected from the group consisting of H, HO, NH 2 , 
naturally occurring nucleobases adenine (A), thymine (T), cytosine (C) and guanine 
(G), non-naturally occurring nucleobases, DNA intercalators, heterocyclic moieties 
and reporter Hgands; 

- each chiral monomeric unit independently selected from the four possible 
diastereomers; and 

- R^H/Flurophore/Biotinij R 2 =OH/NH(CH 2 ) 2 COOH/ 
NH(CH 2 )3NH(CH 2 ) 4 NH(CH2)3NH 2 . 

2. A novel chiral, peptide nucleic acid oligomers having the formula : 




aep PNA m 
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which are heteropolymeric aepPNA in (with all four possible diastereomers) involving 
one or more substitution of the non-chiral aeg unit of aminoethylglycyl PNA I in 
aepPNA II as below: 

- each chiral monomer unit independently selected from the four possible diastereomers, 

- a,b,c,d,m,n are integers with independent values in the range ltolO and various 

combinations thereof, '' 

- Ri is H/COCH3 or L (corresponding to a fluorophore e.g. dansyl, carboxyfluorescein), 

- R 2 is OH, NH 2 , NHCH 2 CH 2 COOH, sperminyl i.e., NH(CH 2 ) 3 Mi(CH 2 ) 4 NH(CH 2 ) 3 NH 2 , 
and 

- each of B^B 2 is independently selected from the group consisting of H, HO, NH 2 , 
naturally occurring nucleobases, non-naturally occurring nucleobases, DNA 
intercalators, heterocyclic moieties and reporter ligands. 

3 . A novel chiral, peptide nucleic acid oligomers as claimed in claim 2, wherein m=n=l ; 
B z = T; Ri=H; R 2 = NH (CH 2 ) 2 COOH, with 

i. a=7, b=l, c=d=0, u 

ii. a=c=3, b=d=l, 

iii. a=b=c=d=l, repeating twice in that order, 

iv. a=b=c=0, d=8, 

v a =d=0, b=l, c=7-l 1 and with various combinations of B z . 

4. A novel chiral, peptide nucleic acid as claimed in claim 1 and 2, wherein the 
oligomers are synthesized by adaptation of standard peptide synthesis procedures, 
either in solution or in solid phase. 

5. A monomer precursor-synthon having the formula IV 



T 




IV 



o 
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Wherein, 

Ri =H/Boc/Fmoc, R 2 = OMe/OH/OEt/Obenzyl, 

variation of chirality at positions 2 and 4 leading to four diastereomers (2S,4R), 
(2R,4S), (2S,4S) and (2R,4R), and 
T is the nucleo base. 

A monomer precursor-synthon as claimed in claim 5, wherein T is a naturally 
occurring nucleobase. 

A process for preparing compounds according to claim 5, comprising the steps of : 

A. providing the alkylating reagent (N-Boc)-2-aminoethylbromide (2) in two steps 
from 2-aminoethanol; 

B. providing N-alkylation of 4-hydroxyprolinemethylester with reagent prepared as 
in claim 7A 

-alkylation of 4#-hydroxy-2S-prolinemethylester (la) with (N-Boc)-2- 

aminoethyl bromide (2) to afford [l-(N-Boc)-2-aminoethyl]-4#-hydroxy-2S- 

prolinemethyl ester (3), 
-alkylation of 4iMrydroxy-2i?-prolinemethylester (lb) with (N-Boc)-2- 

aminoethyl bromide (2) to afford [l-(N-Boc)-2-arninoethyl]-4i?-hydroxy-2R- 

prolinemethyl ester (5), 
-alkylation of 4S-hydroxy-2R-prolinemethylester with (N-Boc)-2-aminoethyl 

bromide (2) to afford [l-(N-Boc)-2-aminoethyl]-4S-hydroxy-2#-proline 

methylester 

-alkylation of 4S-hydroxy-2S-proIinemethylester with (N-Boc)-2-aminoethyl 
bromide (2) to afford [l-(N-Boc)-2-aminoethyl]-4S-hydroxy-2S-proline 
methylester, 

C. producing monomer synthons (4a) and (6a) by Mitsunobu reaction of compounds 
prepared according to claim 7B with N3-benzoylthymine. 
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8. A process for introducing novel chiral monomers as claimed in claim 7C at 
specific/desired positions) in the oligomers of desired sequences 

9. A process for sequence specific recognition of a single or double stranded 
polynucleotide (DNA, RNA), by the oligomers as per claims 1 and 2 derived from the 
compounds according to claim 7. 

10. A method of using peptide nucleic acid oligomers as claimed in claim 9 for 
diagnosing and/or modulating the expression of genes in organisms. 

11. A method as claimed claim 10 wherein said modulation includes inhibiting 
transcription and replication of the said gene. 

12. A process for treating disease conditions associated with undesired protein production 
in an organism by using the compound according to claims 1 and 2. 

13. A pharmaceutical composition comprising a compound according to claims 1 and 2 
along with any other pharmaceutical ly effective agents. 
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ABSTRACT 



A novel class of peptide nucleic acid compounds, which are positively charged, chiral 
analogs based on cyclic monomeric units, bind to complementary ssDNA, dsDNA and 
RNA strands with more affinity than the corresponding achiral, linear peptide nucleic 
acids, without compromising the binding sequence specificity and the compounds of this 
invention are more soluble in water than the standard peptide nucleic acids. 
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Figure 7 Solid Phase synthesis of Oligomers 
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Figure 8. Oligomer sequences comprising novel monomers of the invention 



Oligomer sequences 
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A/T = aegPNA-A/T, t = aepPNA -T 
DNA sequences 



14 5'- G C A A A A A A A A C G -3' 

15 5'- G C A A A T A A A A C G -3' 

16 5'-AATAATAATATA -3' 
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FIG. 9 
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FIGURE 10 A. MALDI-TOF mass spectrum of PNA 7 
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Figure 10 B. MALDI-TOF mass spectrum of PNA 8 and 9 
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FIGURE 10 A. MALDI-TOF mass spectrum of PNA \t } O 
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Figure 10B. MALDI-TOF mass spectrum of PNA M ; /2_ 
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Figure 11 



UV melting profiles 




FIG. 11 
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FIGURE 12 

Gel electrophoresis 




1 234 5 6 7 8 




15% Polyacrylamide Gel Electrophoresis (acrylamideitos-acrylamide, 29:1) of 
ae/>PNA:DNA complexes. Bands were visualized by UV-shadowing, i.e., by illuminating 
the gel placed on a fluorescent thin-layer silica gel chromatographic plate, F 2 54, 20cm x 
20cm using UV light. Lane'l: 7; Lane 2: (8: DNA 14); Lane 3: (7: DNA 14); Lane 4: 
DNA 14; Lane 5: DNA 15; Lane 6: (8: DNA 14); Lane 7: 8; Lane 8: (8: DNA 15). 



Oligomer sequences 

7 H-TTTTTTTt -(P-Ala)-O: 

8 H-TTTt TTTt -(0-Ala)-O: 
T = a<?gPNA-T, t = aepPNA -T 

DNA sequences 

14 5'- G C A A A A A A A A C G -3' 

15 5'- G C A A A T A A A A C G -3' 
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